Search CORE

167 research outputs found

A structural study for the optimisation of functional motifs encoded in protein sequences

Author: Helmer-Citterich Manuela
Via Allegra
Publication venue: BioMed Central
Publication date: 01/01/2004
Field of study

BACKGROUND: A large number of PROSITE patterns select false positives and/or miss known true positives. It is possible that – at least in some cases – the weak specificity and/or sensitivity of a pattern is due to the fact that one, or maybe more, functional and/or structural key residues are not represented in the pattern. Multiple sequence alignments are commonly used to build functional sequence patterns. If residues structurally conserved in proteins sharing a function cannot be aligned in a multiple sequence alignment, they are likely to be missed in a standard pattern construction procedure. RESULTS: Here we present a new procedure aimed at improving the sensitivity and/ or specificity of poorly-performing patterns. The procedure can be summarised as follows: 1. residues structurally conserved in different proteins, that are true positives for a pattern, are identified by means of a computational technique and by visual inspection. 2. the sequence positions of the structurally conserved residues falling outside the pattern are used to build extended sequence patterns. 3. the extended patterns are optimised on the SWISS-PROT database for their sensitivity and specificity. The method was applied to eight PROSITE patterns. Whenever structurally conserved residues are found in the surface region close to the pattern (seven out of eight cases), the addition of information inferred from structural analysis is shown to improve pattern selectivity and in some cases selectivity and sensitivity as well. In some of the cases considered the procedure allowed the identification of functionally interesting residues, whose biological role is also discussed. CONCLUSION: Our method can be applied to any type of functional motif or pattern (not only PROSITE ones) which is not able to select all and only the true positive hits and for which at least two true positive structures are available. The computational technique for the identification of structurally conserved residues is already available on request and will be soon accessible on our web server. The procedure is intended for the use of pattern database curators and of scientists interested in a specific protein family for which no specific or selective patterns are yet available

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

ART

Archivio della ricerca- Università di Roma La Sapienza

Query3d: a new method for high-throughput analysis of functional residues in protein structures

Author: Ausiello Gabriele
Helmer-Citterich Manuela
Via Allegra
Publication venue: BioMed Central
Publication date: 01/01/2005
Field of study

BACKGROUND: The identification of local similarities between two protein structures can provide clues of a common function. Many different methods exist for searching for similar subsets of residues in proteins of known structure. However, the lack of functional and structural information on single residues, together with the low level of integration of this information in comparison methods, is a limitation that prevents these methods from being fully exploited in high-throughput analyses. RESULTS: Here we describe Query3d, a program that is both a structural DBMS (Database Management System) and a local comparison method. The method conserves a copy of all the residues of the Protein Data Bank annotated with a variety of functional and structural information. New annotations can be easily added from a variety of methods and known databases. The algorithm makes it possible to create complex queries based on the residues' function and then to compare only subsets of the selected residues. Functional information is also essential to speed up the comparison and the analysis of the results. CONCLUSION: With Query3d, users can easily obtain statistics on how many and which residues share certain properties in all proteins of known structure. At the same time, the method also finds their structural neighbours in the whole PDB. Programs and data can be accessed through the PdbFun web interface

Crossref

Springer - Publisher Connector

PubMed Central

ART

Archivio della ricerca- Università di Roma La Sapienza

The Bioinformatics Italian Society

Author: Manuela Helmer-Citterich
Paolo Romano
Publication venue
Publication date: 29/04/2012
Field of study

The Bioinformatics Italian Society (BITS) is a non-profit scientific association grounded on 19 June 2003, to gather scientists with interests in the field of Bioinformatics, intended as multidisciplinary science studying biological problems at the molecular level by using informatics and computational methods. The Society has now about 230 members and aims at overcoming 250 in 2012

Open Access Repository

Bioinformatics in Italy: BITS2006, the third annual meeting of the Italian Society of Bioinformatics

Author: Graziano Pesole
Manuela Helmer-Citterich
Rita Casadio
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref

Dissecting the Genome for Drug Response Prediction

Author: Carrino Chiara
Helmer-Citterich Manuela
Parca Luca
Pepe Gerardo
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2022
Field of study

The prediction of the cancer cell lines sensitivity to a specific treatment is one of the current challenges in precision medicine. With omics and pharmacogenomics data being available for over 1000 cancer cell lines, several machine learning and deep learning algorithms have been proposed for drug sensitivity prediction. However, deciding which omics data to use and which computational methods can efficiently incorporate data from different sources is the challenge which several research groups are working on. In this review, we summarize recent advances in the representative computational methods that have been developed in the last 2 years on three public datasets: COSMIC, CCLE, NCI-60. These methods aim to improve the prediction of the cancer cell lines sensitivity to a given treatment by incorporating drug's chemical information in the input or using a priori feature selection. Finally, we discuss the latest published method which aims to improve the prediction of clinical drug response of real patients starting from cancer cell line molecular profiles

ART

pdbFun: mass selection and fast comparison of annotated PDB residues

Author: Ausiello Gabriele
Helmer-Citterich Manuela
Peluso Daniele
Via Allegra
Zanzoni Andreas
Publication venue: Oxford University Press
Publication date: 01/01/2005
Field of study

pdbFun () is a web server for structural and functional analysis of proteins at the residue level. pdbFun gives fast access to the whole Protein Data Bank (PDB) organized as a database of annotated residues. The available data (features) range from solvent exposure to ligand binding ability, location in a protein cavity, secondary structure, residue type, sequence functional pattern, protein domain and catalytic activity. Users can select any residue subset (even including any number of PDB structures) by combining the available features. Selections can be used as probe and target in multiple structure comparison searches. For example a search could involve, as a query, all solvent-exposed, hydrophylic residues that are not in alpha-helices and are involved in nucleotide binding. Possible examples of targets are represented by another selection, a single structure or a dataset composed of many structures. The output is a list of aligned structural matches offered in tabular and also graphical format

SH3-Hunter: discovery of SH3 domain interaction sites in proteins

Author: Ausiello Gabriele
Ferraro Enrico
Helmer-Citterich Manuela
Peluso Daniele
Via Allegra
Publication venue: Oxford University Press
Publication date: 01/01/2007
Field of study

SH3-Hunter (http://cbm.bio.uniroma2.it/SH3-Hunter/) is a web server for the recognition of putative SH3 domain interaction sites on protein sequences. Given an input query consisting of one or more protein sequences, the server identifies peptides containing poly-proline binding motifs and associates them to a list of SH3 domains, in order to compose peptide–domain pairs. The server can accept a list of peptides and allows users to upload an input file in a proper format. An accurate selection of SH3 domains is available and users can also submit their own SH3 domain sequence

CiteSeerX

PubMed Central

ART

Archivio della ricerca- Università di Roma La Sapienza

Variation in the co-expression profile highlights a loss of miRNA-mRNA regulation in multiple cancer types

Author: Ausiello Gabriele
Helmer-Citterich Manuela
Parca Luca
Pepe Gerardo
Viviani Lorenzo
Publication venue: 'Elsevier BV'
Publication date: 01/06/2022
Field of study

Recent research provides insight into the ability of miRNA to regulate various pathways in several cancer types. Despite their involvement in the regulation of the mRNA via targeting the 3'UTR, there are relatively few studies examining the changes in these regulatory mechanisms specific to single cancer types or shared between different cancer types.We analyzed samples where both miRNA and mRNA expression had been measured and performed a thorough correlation analysis on 7494 experimentally validated human miRNA-mRNA target-gene pairs in both healthy and tumoral samples.We show how more than 90% of these miRNA-mRNA interactions show a loss of regulation in the tumoral samples compared with their healthy counterparts.As expected, we found shared miRNA-mRNA dysregulated pairs among different tumors of the same tissue. However, anatomically different cancers also share multiple dysregulated interactions, suggesting that some cancer-related mechanisms are not tumor-specific. 2865 unique miRNA-mRNA pairs were identified across 13 cancer types, approximate to 40% of these pairs showed a loss of correlation in the tumoral samples in at least 2 out of the 13 analyzed cancers. Specifically, miR-200 family, miR-155 and miR-1 were identified, based on the computational analysis described below, as the miRNAs that potentially lose the highest number of interactions across different samples (only literature-based interactions were used for this analysis).Moreover, the miR-34a/ALDH2 and miR-9/MTHFD2 pairs show a switch in their correlation between healthy and tumor kidney samples suggesting a possible change in the regulation exerted by the miRNAs. Interestingly, the expression of these mRNAs is also associated with the overall survival. The disruption of miRNA regulation on its target, therefore, suggests the possible involvement of these pairs in cell malignant functions.The analysis reported here shows how the regulation of miRNA-mRNA interactions strongly differs between healthy and tumoral cells, based on the strong correlation variation between miRNA and its target that we obtained by analyzing the expression data of healthy and tumor tissue in highly reliable miRNA-target pairs. Finally, a go term enrichment analysis shows that the critical pairs identified are involved in cellular adhesion, proliferation, and migration

ART

COTAN: scRNA-seq data analysis based on gene co-expression

Author: Cremisi Federico
Galfrè Silvia Giulia
Helmer-Citterich Manuela
Morandin Francesco
Pietrosanto Marco
Publication venue: 'Oxford University Press (OUP)'
Publication date: 01/01/2021
Field of study

Estimating the co-expression of cell identity factors in single-cell is crucial. Due to the low efficiency of scRNA-seq methodologies, sensitive computational approaches are critical to accurately infer transcription profiles in a cell population. We introduce COTAN, a statistical and computational method, to analyze the co-expression of gene pairs at single cell level, providing the foundation for single-cell gene interactome analysis. The basic idea is studying the zero UMI counts' distribution instead of focusing on positive counts; this is done with a generalized contingency tables framework. COTAN can assess the correlated or anti-correlated expression of gene pairs, providing a new correlation index with an approximate p-value for the associated test of independence. COTAN can evaluate whether single genes are differentially expressed, scoring them with a newly defined global differentiation index. Similarly to correlation network analysis, it provides ways to plot and cluster genes according to their co-expression pattern with other genes, effectively helping the study of gene interactions, becoming a new tool to identify cell-identity markers. We assayed COTAN on two neural development datasets with very promising results. COTAN is an R package that complements the traditional single cell RNA-seq analysis and it is available at https://github.com/seriph78/COTAN

Archivio istituzionale della Ricerca - Università degli Studi di Parma

ART

Phospho3D: a database of three-dimensional structures of protein phosphorylation sites

Author: Ausiello Gabriele
Gherardini Pier Federico
Helmer-Citterich Manuela
Via Allegra
Zanzoni Andreas
Publication venue: Oxford University Press
Publication date: 16/11/2006
Field of study

Phosphorylation is the most common protein post-translational modification. Phosphorylated residues (serine, threonine and tyrosine) play critical roles in the regulation of many cellular processes. Since the amount of data produced by screening assays is growing continuously, the development of computational tools for collecting and analysing experimental data has become a pivotal task for unravelling the complex network of interactions regulating eukaryotic cell life. Here we present Phospho3D, , a database of 3D structures of phosphorylation sites, which stores information retrieved from the phospho.ELM database and is enriched with structural information and annotations at the residue level. The database also collects the results of a large-scale structural comparison procedure providing clues for the identification of new putative phosphorylation sites

Archivio della ricerca- Università di Roma La Sapienza